NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Machine Unlearning via Simulated Oracle Matching

Georgiev, Kristian; Rinberg, Roy; Park, Sung Min; Garg, Shivam; Ilyas, Andrew; Madry, Aleksander; Neel, Seth (July 2024, ICML GenLaw Workshop (https://www.genlaw.org/2024-icml))

Machine unlearning---efficiently removing the effect of a small "forget set" of training data on a pre-trained machine learning model---has recently attracted significant research interest. Despite this interest, however, recent work shows that existing machine unlearning techniques do not hold up to thorough evaluation in non-convex settings. In this work, we introduce a new machine unlearning technique that exhibits strong empirical performance even in such challenging settings. Our starting point is the perspective that the goal of unlearning is to produce a model whose outputs are statistically indistinguishable from those of a model re-trained on all but the forget set. This perspective naturally suggests a reduction from the unlearning problem to that of *data attribution, where the goal is to predict the effect of changing the training set on a model's outputs. Thus motivated, we propose the following meta-algorithm, which we call Datamodel Matching (DMM): given a trained model, we (a) use data attribution to predict the output of the model if it were re-trained on all but the forget set points; then (b) fine-tune the pre-trained model to match these predicted outputs. In a simple convex setting, we show how this approach provably outperforms a variety of iterative unlearning algorithms. Empirically, we use a combination of existing evaluations and a new metric based on the KL-divergence to show that even in non-convex settings, DMM achieves strong unlearning performance relative to existing algorithms. An added benefit of DMM is that it is a meta-algorithm, in the sense that future advances in data attribution translate directly into better unlearning algorithms, pointing to a clear direction for future progress in unlearning.
more » « less
Full Text Available
Combining Diverse Feature Priors

Jain, Saachi; Tsipras, Dimitris; Madry, Aleksander (January 2022, Proceedings of the 39th International Conference on Machine Learning)

Full Text Available
Certified Patch Robustness via Smoothed Vision Transformers

Salman, Hadi; Jain, Saachi; Wong, Eric; Madry, Aleksander (January 2022, Proceedings of Computer Vision and Pattern Recognition Conference)

Full Text Available
Datamodels: Predicting Predictions from Training Data

Ilyas, Andrew; Park, Sung Min; Engstrom, Logan; Leclerc, Guillaume; Madry, Aleksander (January 2022, Proceedings of the 39th International Conference on Machine Learning)

Full Text Available
Missingness Bias in Model Debugging

Jain, Saachi; Salman, Hadi; Wong, Eric; Zhang, Pengchuan; Vineet, Vibhav; Vemprala, Sai; Madry, Aleksander (January 2022, International Conference on Learning Representations)

Full Text Available
Adversarially trained neural representations may already be as robust as corresponding biological neural representations

Guo, Chong; Lee, Michael J.; Leclerc, Guillaume; Dapello, Joel; Rao, Yug; Madry, Aleksander; DiCarlo, James J. (January 2022, Proceedings of the 38th International Conference on Machine Learning)

Full Text Available
Leveraging Sparse Linear Layers for Debuggable Deep Networks

Wong, Eric; Santurkar, Shibani; Madry, Aleksander (January 2021, Proceedings of the 38th International Conference on Machine Learning)
null (Ed.)
Full Text Available
BREEDS: Benchmarks for Subpopulation Shift

Santurkar, Shibani; Tsipras, Dimitris; Madry, Aleksander (January 2021, International Conference on Learning Representations)
null (Ed.)
Full Text Available
Circulation Control for Faster Minimum Cost Flow in Unit-Capacity Graphs

Axiotis, Kyriakos; Madry, Aleksander; Vladu, Adrian (November 2020, IEEE Symposium on Foundations of Computer Science)
null (Ed.)
We present an m^{4/3+o(1)} log W -time algorithm for solving the minimum cost flow problem in graphs with unit capacity, where W is the maximum absolute value of any edge weight. For sparse graphs, this improves over the best known running time for this problem and, by well-known reductions, also implies improved running times for the shortest path problem with negative weights, minimum cost bipartite b-matching when |b|_1 = O(m), and recovers the running time of the currently fastest algorithm for maximum flow in graphs with unit capacities (Liu-Sidford, 2020). Our algorithm relies on developing an interior point method–based framework which acts on the space of circulations in the underlying graph. From the combinatorial point of view, this framework can be viewed as iteratively improving the cost of a suboptimal solution by pushing flow around circulations. These circulations are derived by computing a regularized version of the standard Newton step, which is partially inspired by previous work on the unit-capacity maximum flow problem (Liu-Sidford, 2019), and subsequently refined based on the very recent progress on this problem (Liu-Sidford, 2020). The resulting step problem can then be computed efficiently using the recent work on l_p-norm minimizing flows (Kyng-Peng-Sachdeva-Wang, 2019). We obtain our faster algorithm by combining this new step primitive with a customized preconditioning method, which aims to ensure that the graph on which these circulations are computed has sufficiently large conductance.
more » « less
Full Text Available
Circulation Control for Faster Minimum Cost Flow in Unit-Capacity Graphs

Axiotis, Kyriakos; Madry, Aleksander; Vladu, Adrian (November 2020, IEEE Symposium on Foundations of Computer Science)
null (Ed.)
We present an $$m^{4/3}+o(1) \log W$$ -time algorithm for solving the minimum cost flow problem in graphs with unit capacity, where W is the maximum absolute value of any edge weight. For sparse graphs, this improves over the best known running time for this problem and, by well-known reductions, also implies improved running times for the shortest path problem with negative weights, minimum cost bipartite $$b$$-matching when $$\|b\|_1 = O(m)$$, and recovers the running time of the currently fastest algorithm for maximum flow in graphs with unit capacities (Liu-Sidford, 2020). Our algorithm relies on developing an interior point method–based framework which acts on the space of circulations in the underlying graph. From the combinatorial point of view, this framework can be viewed as iteratively improving the cost of a suboptimal solution by pushing flow around circulations. These circulations are derived by computing a regularized version of the standard Newton step, which is partially inspired by previous work on the unit-capacity maximum flow problem (Liu-Sidford, 2019), and subsequently refined based on the very re- cent progress on this problem (Liu-Sidford, 2020). The resulting step problem can then be computed efficiently using the recent work on $$l_p$$-norm minimizing flows (Kyng-Peng-Sachdeva- Wang, 2019). We obtain our faster algorithm by combining this new step primitive with a customized preconditioning method, which aims to ensure that the graph on which these circulations are computed has sufficiently large conductance.
more » « less
Full Text Available

« Prev Next »

Search for: All records